CLOUDP-322487 - Webhook clusterrole #393

nammn · 2025-09-01T18:49:48Z

Summary

This pull request introduces Helm chart unit tests to the CI pipeline and ensures consistent naming for ClusterRole and ClusterRoleBinding resources in the Helm chart. The most important changes are grouped below:

Helm Chart Testing Integration:

Added a helm-tests target to the Makefile to run Helm chart unit tests using the helm-unittest plugin. The target installs the plugin if necessary and runs tests in the helm_chart directory.
Created a test_helm_unit function in .evergreen-functions.yml to execute the new Helm unit tests as part of CI.
Added a unit_tests_helm task to .evergreen.yml and included it in the unit_tests_task_group to ensure Helm unit tests run with other unit tests in the CI pipeline.
Helm Chart Improvements and Testing:
Updated operator-roles-webhook.yaml to dynamically generate consistent names for ClusterRole and ClusterRoleBinding resources based on the operator name and namespace, preventing naming conflicts across multiple installations.
Added a new Helm unit test suite (webhook_clusterrole_test.yaml) to verify that ClusterRole and ClusterRoleBinding names are consistent and unique per installation.

Example

the new names:
binding
<name>-<ns>-webhook-crb
role
<name>-<ns>-webhook-cr

old names:
binding
<name>-<ns>-webhook
role
mongodb-kubernetes-operator-mongodb-webhook

Proof of Work

green ci
helm chart unit test as part of unit test: Link (IMO that unit test is more for documentation than actually testing, its nice to have a view how those names are generated in the end)

Checklist

Have you linked a jira ticket and/or is the ticket in the title?
Have you checked whether your jira ticket required DOCSP changes?
Have you added changelog file?
- use skip-changelog label if not needed
- refer to Changelog files and Release Notes section in CONTRIBUTING.md for more details

github-actions · 2025-09-01T18:51:59Z

⚠️ (this preview might not be accurate if the PR is not rebased on current master branch)

MCK 1.3.0 Release Notes

New Features

Multi-Architecture Support

We've added comprehensive multi-architecture support for the kubernetes operator. This enhancement enables deployment on IBM Power (ppc64le) and IBM Z (s390x) architectures alongside
existing x86_64 support. Core images (operator, agent, init containers, database, readiness probe) now support multiple architectures. We do not add support IBM and ARM support for Ops-Manager and the init-ops-manager image.

MongoDB Agent images have been migrated to new container repository: quay.io/mongodb/mongodb-agent.
- the agents in the new repository will support the x86-64, ARM64, s390x, and ppc64le architectures. More can be read in the public docs.
- operator running >=MCK1.3.0 and static cannot use the agent images from the old container repository quay.io/mongodb/mongodb-agent-ubi.
quay.io/mongodb/mongodb-agent-ubi should not be used anymore, it's only there for backwards compatibility.

Bug Fixes

This change fixes the current complex and difficult-to-maintain architecture for stateful set containers, which relies on an "agent matrix" to map operator and agent versions which led to a sheer amount of images.
We solve this by shifting to a 3-container setup. This new design eliminates the need for the operator-version/agent-version matrix by adding one additional container containing all required binaries. This architecture maps to what we already do with the mongodb-database container.
Fixed an issue where the readiness probe reported the node as ready even when its authentication mechanism was not in sync with the other nodes, potentially causing premature restarts.
Fixed an issue where the MongoDB Agents did not adhere to the NO_PROXY environment variable configured on the operator.

Other Changes

Optional permissions for PersistentVolumeClaim moved to a separate role. When managing the operator with Helm it is possible to disable permissions for PersistentVolumeClaim resources by setting operator.enablePVCResize value to false (true by default). When enabled, previously these permissions were part of the primary operator role. With this change, permissions have a separate role.
subresourceEnabled Helm value was removed. This setting used to be true by default and made it possible to exclude subresource permissions from the operator role by specifying false as the value. We are removing this configuration option, making the operator roles always have subresource permissions. This setting was introduced as a temporary solution for this OpenShift issue. The issue has since been resolved and the setting is no longer needed.
We have deliberately not published the container images for OpsManager versions 7.0.16, 8.0.8, 8.0.9 and 8.0.10 due to a bug in the OpsManager which prevents MCK customers to upgrade their OpsManager deployments to those versions.

…rnetes into webhook-clusterrole

nammn · 2025-09-02T12:48:50Z

helm_chart/templates/operator-roles-webhook.yaml

@@ -1,12 +1,13 @@

 {{/* This cluster role and binding is necessary to allow the operator to automatically register ValidatingWebhookConfiguration. */}}
 {{- if and .Values.operator.webhook.registerConfiguration .Values.operator.webhook.installClusterRole }}
-{{- if not (lookup "rbac.authorization.k8s.io/v1" "ClusterRole" "" "mongodb-kubernetes-operator-mongodb-webhook") }}
+{{- $webhookClusterRoleName := printf "%s-%s-webhook-cr" .Values.operator.name (include "mongodb-kubernetes-operator.namespace" .) }}


i've changed both names, because one was dynamic and one not - causing upgrade problems. Now both are just a new set of names.

Makefile

lucian-tosa · 2025-09-02T14:34:34Z

helm_chart/tests/webhook_clusterrole_test.yaml

does this test verify that subsequent upgrades of the chart does not break the rbac?

no - we only have e2e tests that verify latest (1.2.0) -> current (code build) and those are passing.

We don't have current -> current, but I don't see how this could be failing, do you think we should test this in particular?

Ideally we should have a test in our CI that upgrades the released helm chart version to the current local version to test things.

we only have e2e tests that verify latest (1.2.0) -> current (code build) and those are passing.

isn't that exactly that - or are we talking about different things?

its operator_upgrade_ops_manager we have suite of those

mongodb-kubernetes/docker/mongodb-kubernetes-tests/tests/upgrades/operator_upgrade_ops_manager.py

Line 148 in f29ac63

operator = get_default_operator(

nammn added 2 commits September 1, 2025 20:46

add unit test

be2afb0

add unit test

dcc2ec2

nammn added the skip-changelog Use this label in Pull Request to not require new changelog entry file label Sep 1, 2025

nammn added 5 commits September 1, 2025 20:54

add unit test

c51782a

add explicit cr and crb

b13fdd0

Merge branch 'master' into webhook-clusterrole

d7ec9ba

add explicit cr and crb

26431a5

Merge branch 'webhook-clusterrole' of github.com:mongodb/mongodb-kube…

033d782

…rnetes into webhook-clusterrole

nammn marked this pull request as ready for review September 2, 2025 09:49

nammn requested a review from a team as a code owner September 2, 2025 09:49

nammn requested review from anandsyncs and viveksinghggits September 2, 2025 09:49

nammn commented Sep 2, 2025

View reviewed changes

anandsyncs approved these changes Sep 2, 2025

View reviewed changes

viveksinghggits approved these changes Sep 2, 2025

View reviewed changes

Makefile Outdated Show resolved Hide resolved

nammn removed the skip-changelog Use this label in Pull Request to not require new changelog entry file label Sep 2, 2025

lucian-tosa reviewed Sep 2, 2025

View reviewed changes

remove comment

6a322f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CLOUDP-322487 - Webhook clusterrole #393

CLOUDP-322487 - Webhook clusterrole #393

nammn commented Sep 1, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 1, 2025 •

edited

Loading

Uh oh!

nammn Sep 2, 2025

Uh oh!

Uh oh!

lucian-tosa Sep 2, 2025

Uh oh!

nammn Sep 2, 2025

Uh oh!

viveksinghggits Sep 2, 2025

Uh oh!

nammn Sep 2, 2025 •

edited

Loading

Uh oh!

nammn Sep 2, 2025

Uh oh!

Uh oh!

CLOUDP-322487 - Webhook clusterrole #393

Are you sure you want to change the base?

CLOUDP-322487 - Webhook clusterrole #393

Conversation

nammn commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Example

Proof of Work

Checklist

Uh oh!

github-actions bot commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

MCK 1.3.0 Release Notes

New Features

Multi-Architecture Support

Bug Fixes

Other Changes

Uh oh!

nammn Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lucian-tosa Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

nammn Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

viveksinghggits Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

nammn Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nammn Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nammn commented Sep 1, 2025 •

edited

Loading

github-actions bot commented Sep 1, 2025 •

edited

Loading

nammn Sep 2, 2025 •

edited

Loading